Overview
Brought to you by YData
Dataset statistics
| Number of variables | 29 |
|---|---|
| Number of observations | 34603 |
| Missing cells | 13118 |
| Missing cells (%) | 1.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 7.7 MiB |
| Average record size in memory | 232.0 B |
Variable types
| Text | 10 |
|---|---|
| Categorical | 9 |
| Numeric | 8 |
| DateTime | 2 |
db_code_commune is highly overall correlated with db_code_dpt and 1 other fields | High correlation |
db_code_dpt is highly overall correlated with db_code_commune and 1 other fields | High correlation |
iso_pays is highly overall correlated with db_code_commune and 1 other fields | High correlation |
db_continent is highly imbalanced (92.4%) | Imbalance |
dd_continent is highly imbalanced (93.1%) | Imbalance |
iso_date is highly imbalanced (76.0%) | Imbalance |
db_lib_commune has 481 (1.4%) missing values | Missing |
db_dept_isocode_3166 has 5472 (15.8%) missing values | Missing |
dd_lib_commune has 691 (2.0%) missing values | Missing |
dd_dept_isocode_3166 has 6334 (18.3%) missing values | Missing |
age has 12083 (34.9%) zeros | Zeros |
distance has 12051 (34.8%) zeros | Zeros |
Reproduction
| Analysis started | 2025-05-31 10:10:35.573687 |
|---|---|
| Analysis finished | 2025-05-31 10:10:53.837471 |
| Duration | 18.26 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
nom
Text
| Distinct | 22640 |
|---|---|
| Distinct (%) | 65.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 270.5 KiB |
Length
| Max length | 38 |
|---|---|
| Median length | 29 |
| Mean length | 7.0940959 |
| Min length | 2 |
Unique
| Unique | 18048 ? |
|---|---|
| Unique (%) | 52.2% |
Sample
| 1st row | LETERME |
|---|---|
| 2nd row | DEROUT |
| 3rd row | GRAS Y PLASSARD |
| 4th row | LENOIR |
| 5th row | LESZCZUK |
| Value | Count | Frequency (%) |
| le | 364 | 1.0% |
| de | 192 | 0.5% |
| martin | 102 | 0.3% |
| simon | 54 | 0.2% |
| robert | 53 | 0.1% |
| durand | 50 | 0.1% |
| bernard | 50 | 0.1% |
| thomas | 49 | 0.1% |
| richard | 48 | 0.1% |
| laurent | 48 | 0.1% |
| Other values (22591) | 34807 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 29772 | |
| A | 24314 | 9.9% |
| R | 21706 | 8.8% |
| I | 17138 | 7.0% |
| O | 16412 | 6.7% |
| L | 15965 | 6.5% |
| N | 15908 | 6.5% |
| U | 12845 | 5.2% |
| T | 11917 | 4.9% |
| S | 10886 | 4.4% |
| Other values (19) | 68614 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 245477 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 29772 | |
| A | 24314 | 9.9% |
| R | 21706 | 8.8% |
| I | 17138 | 7.0% |
| O | 16412 | 6.7% |
| L | 15965 | 6.5% |
| N | 15908 | 6.5% |
| U | 12845 | 5.2% |
| T | 11917 | 4.9% |
| S | 10886 | 4.4% |
| Other values (19) | 68614 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 245477 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 29772 | |
| A | 24314 | 9.9% |
| R | 21706 | 8.8% |
| I | 17138 | 7.0% |
| O | 16412 | 6.7% |
| L | 15965 | 6.5% |
| N | 15908 | 6.5% |
| U | 12845 | 5.2% |
| T | 11917 | 4.9% |
| S | 10886 | 4.4% |
| Other values (19) | 68614 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 245477 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 29772 | |
| A | 24314 | 9.9% |
| R | 21706 | 8.8% |
| I | 17138 | 7.0% |
| O | 16412 | 6.7% |
| L | 15965 | 6.5% |
| N | 15908 | 6.5% |
| U | 12845 | 5.2% |
| T | 11917 | 4.9% |
| S | 10886 | 4.4% |
| Other values (19) | 68614 |
prenom
Text
| Distinct | 20813 |
|---|---|
| Distinct (%) | 60.4% |
| Missing | 138 |
| Missing (%) | 0.4% |
| Memory size | 270.5 KiB |
Length
| Max length | 50 |
|---|---|
| Median length | 40 |
| Mean length | 13.850689 |
| Min length | 1 |
Unique
| Unique | 18030 ? |
|---|---|
| Unique (%) | 52.3% |
Sample
| 1st row | PATRICE,EUGENE,OMER |
|---|---|
| 2nd row | JEAN,DANIEL |
| 3rd row | BORIS,GUY |
| 4th row | VALERIE,PAULE |
| 5th row | MARIE-AGNES |
| Value | Count | Frequency (%) |
| jean | 165 | 0.5% |
| david | 148 | 0.4% |
| marie | 143 | 0.4% |
| joseph | 130 | 0.4% |
| michel | 124 | 0.4% |
| nathalie | 118 | 0.3% |
| alain | 108 | 0.3% |
| christophe | 104 | 0.3% |
| philippe | 102 | 0.3% |
| laurent | 100 | 0.3% |
| Other values (20805) | 33255 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 75659 | |
| A | 48033 | 10.1% |
| I | 42253 | 8.9% |
| R | 39327 | 8.2% |
| N | 36251 | 7.6% |
| , | 34079 | 7.1% |
| L | 28244 | 5.9% |
| C | 18815 | 3.9% |
| S | 17724 | 3.7% |
| O | 16528 | 3.5% |
| Other values (20) | 120451 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 477364 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 75659 | |
| A | 48033 | 10.1% |
| I | 42253 | 8.9% |
| R | 39327 | 8.2% |
| N | 36251 | 7.6% |
| , | 34079 | 7.1% |
| L | 28244 | 5.9% |
| C | 18815 | 3.9% |
| S | 17724 | 3.7% |
| O | 16528 | 3.5% |
| Other values (20) | 120451 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 477364 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 75659 | |
| A | 48033 | 10.1% |
| I | 42253 | 8.9% |
| R | 39327 | 8.2% |
| N | 36251 | 7.6% |
| , | 34079 | 7.1% |
| L | 28244 | 5.9% |
| C | 18815 | 3.9% |
| S | 17724 | 3.7% |
| O | 16528 | 3.5% |
| Other values (20) | 120451 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 477364 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 75659 | |
| A | 48033 | 10.1% |
| I | 42253 | 8.9% |
| R | 39327 | 8.2% |
| N | 36251 | 7.6% |
| , | 34079 | 7.1% |
| L | 28244 | 5.9% |
| C | 18815 | 3.9% |
| S | 17724 | 3.7% |
| O | 16528 | 3.5% |
| Other values (20) | 120451 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | M |
| 3rd row | M |
| 4th row | F |
| 5th row | F |
Common Values
| Value | Count | Frequency (%) |
| M | 21164 | |
| F | 13439 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| m | 21164 | |
| f | 13439 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 21164 | |
| F | 13439 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 34603 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| M | 21164 | |
| F | 13439 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 34603 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| M | 21164 | |
| F | 13439 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 34603 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| M | 21164 | |
| F | 13439 |
age
Real number (ℝ)
Zeros 
| Distinct | 97 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.350779 |
| Minimum | 0 |
|---|---|
| Maximum | 97 |
| Zeros | 12083 |
| Zeros (%) | 34.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 270.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 12 |
| Q3 | 23 |
| 95-th percentile | 70 |
| Maximum | 97 |
| Range | 97 |
| Interquartile range (IQR) | 23 |
Descriptive statistics
| Standard deviation | 22.508483 |
|---|---|
| Coefficient of variation (CV) | 1.2265683 |
| Kurtosis | 0.5701977 |
| Mean | 18.350779 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 1.2670973 |
| Sum | 634992 |
| Variance | 506.63182 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 12083 | |
| 1 | 1176 | 3.4% |
| 21 | 1030 | 3.0% |
| 19 | 1015 | 2.9% |
| 22 | 988 | 2.9% |
| 18 | 984 | 2.8% |
| 20 | 970 | 2.8% |
| 23 | 923 | 2.7% |
| 17 | 886 | 2.6% |
| 16 | 691 | 2.0% |
| Other values (87) | 13857 |
| Value | Count | Frequency (%) |
| 0 | 12083 | |
| 1 | 1176 | 3.4% |
| 2 | 664 | 1.9% |
| 3 | 531 | 1.5% |
| 4 | 464 | 1.3% |
| 5 | 404 | 1.2% |
| 6 | 374 | 1.1% |
| 7 | 336 | 1.0% |
| 8 | 302 | 0.9% |
| 9 | 261 | 0.8% |
| Value | Count | Frequency (%) |
| 97 | 1 | < 0.1% |
| 96 | 1 | < 0.1% |
| 94 | 4 | < 0.1% |
| 93 | 2 | < 0.1% |
| 92 | 1 | < 0.1% |
| 91 | 5 | |
| 90 | 9 | |
| 89 | 8 | |
| 88 | 12 | |
| 87 | 6 |
long_nom
Real number (ℝ)
| Distinct | 28 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.0940959 |
| Minimum | 2 |
|---|---|
| Maximum | 38 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 270.5 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 6 |
| median | 7 |
| Q3 | 8 |
| 95-th percentile | 11 |
| Maximum | 38 |
| Range | 36 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.1466899 |
|---|---|
| Coefficient of variation (CV) | 0.30260233 |
| Kurtosis | 9.6523279 |
| Mean | 7.0940959 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.8880713 |
| Sum | 245477 |
| Variance | 4.6082777 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 7690 | |
| 6 | 7617 | |
| 8 | 5382 | |
| 5 | 5074 | |
| 9 | 3320 | |
| 4 | 1773 | 5.1% |
| 10 | 1627 | 4.7% |
| 11 | 794 | 2.3% |
| 12 | 339 | 1.0% |
| 3 | 300 | 0.9% |
| Other values (18) | 687 | 2.0% |
| Value | Count | Frequency (%) |
| 2 | 9 | < 0.1% |
| 3 | 300 | 0.9% |
| 4 | 1773 | 5.1% |
| 5 | 5074 | |
| 6 | 7617 | |
| 7 | 7690 | |
| 8 | 5382 | |
| 9 | 3320 | |
| 10 | 1627 | 4.7% |
| 11 | 794 | 2.3% |
| Value | Count | Frequency (%) |
| 38 | 1 | < 0.1% |
| 30 | 1 | < 0.1% |
| 29 | 3 | < 0.1% |
| 27 | 4 | < 0.1% |
| 26 | 4 | < 0.1% |
| 24 | 5 | < 0.1% |
| 23 | 4 | < 0.1% |
| 22 | 14 | |
| 21 | 12 | < 0.1% |
| 20 | 31 |
nbre_prenoms
Real number (ℝ)
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.9848568 |
| Minimum | 1 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 270.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 3 |
| Maximum | 7 |
| Range | 6 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 0.89894686 |
|---|---|
| Coefficient of variation (CV) | 0.45290263 |
| Kurtosis | -0.6588577 |
| Mean | 1.9848568 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.43765442 |
| Sum | 68682 |
| Variance | 0.80810546 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 12676 | |
| 2 | 11225 | |
| 3 | 9359 | |
| 4 | 1247 | 3.6% |
| 5 | 88 | 0.3% |
| 6 | 5 | < 0.1% |
| 7 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 12676 | |
| 2 | 11225 | |
| 3 | 9359 | |
| 4 | 1247 | 3.6% |
| 5 | 88 | 0.3% |
| 6 | 5 | < 0.1% |
| 7 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 7 | 3 | < 0.1% |
| 6 | 5 | < 0.1% |
| 5 | 88 | 0.3% |
| 4 | 1247 | 3.6% |
| 3 | 9359 | |
| 2 | 11225 | |
| 1 | 12676 |
db_date
Date
| Distinct | 13985 |
|---|---|
| Distinct (%) | 40.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 270.5 KiB |
| Minimum | 1848-07-15 00:00:00 |
|---|---|
| Maximum | 1970-12-30 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
db_lib_jour
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 270.5 KiB |
| Lundi | |
|---|---|
| Mardi | |
| Vendredi | |
| Jeudi | |
| Samedi | |
| Other values (2) |
Length
| Max length | 8 |
|---|---|
| Median length | 6 |
| Mean length | 6.4117562 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Lundi |
|---|---|
| 2nd row | Dimanche |
| 3rd row | Mardi |
| 4th row | Samedi |
| 5th row | Samedi |
Common Values
| Value | Count | Frequency (%) |
| Lundi | 5104 | |
| Mardi | 4999 | |
| Vendredi | 4985 | |
| Jeudi | 4931 | |
| Samedi | 4928 | |
| Mercredi | 4918 | |
| Dimanche | 4738 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| lundi | 5104 | |
| mardi | 4999 | |
| vendredi | 4985 | |
| jeudi | 4931 | |
| samedi | 4928 | |
| mercredi | 4918 | |
| dimanche | 4738 |
Most occurring characters
| Value | Count | Frequency (%) |
| d | 34850 | |
| i | 34603 | |
| e | 34403 | |
| r | 19820 | |
| n | 14827 | |
| a | 14665 | |
| u | 10035 | 4.5% |
| M | 9917 | 4.5% |
| m | 9666 | 4.4% |
| c | 9656 | 4.4% |
| Other values (6) | 29424 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 221866 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| d | 34850 | |
| i | 34603 | |
| e | 34403 | |
| r | 19820 | |
| n | 14827 | |
| a | 14665 | |
| u | 10035 | 4.5% |
| M | 9917 | 4.5% |
| m | 9666 | 4.4% |
| c | 9656 | 4.4% |
| Other values (6) | 29424 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 221866 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| d | 34850 | |
| i | 34603 | |
| e | 34403 | |
| r | 19820 | |
| n | 14827 | |
| a | 14665 | |
| u | 10035 | 4.5% |
| M | 9917 | 4.5% |
| m | 9666 | 4.4% |
| c | 9656 | 4.4% |
| Other values (6) | 29424 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 221866 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| d | 34850 | |
| i | 34603 | |
| e | 34403 | |
| r | 19820 | |
| n | 14827 | |
| a | 14665 | |
| u | 10035 | 4.5% |
| M | 9917 | 4.5% |
| m | 9666 | 4.4% |
| c | 9656 | 4.4% |
| Other values (6) | 29424 |
db_week
Real number (ℝ)
| Distinct | 53 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.909343 |
| Minimum | 1 |
|---|---|
| Maximum | 53 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 270.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 12 |
| median | 25 |
| Q3 | 39 |
| 95-th percentile | 50 |
| Maximum | 53 |
| Range | 52 |
| Interquartile range (IQR) | 27 |
Descriptive statistics
| Standard deviation | 15.341462 |
|---|---|
| Coefficient of variation (CV) | 0.59212084 |
| Kurtosis | -1.218183 |
| Mean | 25.909343 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 0.061281186 |
| Sum | 896541 |
| Variance | 235.36046 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40 | 961 | 2.8% |
| 1 | 950 | 2.7% |
| 52 | 808 | 2.3% |
| 9 | 772 | 2.2% |
| 2 | 740 | 2.1% |
| 10 | 725 | 2.1% |
| 20 | 721 | 2.1% |
| 13 | 718 | 2.1% |
| 7 | 711 | 2.1% |
| 6 | 708 | 2.0% |
| Other values (43) | 26789 |
| Value | Count | Frequency (%) |
| 1 | 950 | |
| 2 | 740 | |
| 3 | 685 | |
| 4 | 682 | |
| 5 | 665 | |
| 6 | 708 | |
| 7 | 711 | |
| 8 | 681 | |
| 9 | 772 | |
| 10 | 725 |
| Value | Count | Frequency (%) |
| 53 | 181 | 0.5% |
| 52 | 808 | |
| 51 | 623 | |
| 50 | 638 | |
| 49 | 642 | |
| 48 | 637 | |
| 47 | 574 | |
| 46 | 589 | |
| 45 | 590 | |
| 44 | 570 |
db_code_commune
Real number (ℝ)
High correlation 
| Distinct | 6300 |
|---|---|
| Distinct (%) | 18.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 57799.95 |
| Minimum | 1004 |
|---|---|
| Maximum | 99501 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 270.5 KiB |
Quantile statistics
| Minimum | 1004 |
|---|---|
| 5-th percentile | 10033 |
| Q1 | 33128 |
| median | 59606 |
| Q3 | 78586 |
| 95-th percentile | 99122 |
| Maximum | 99501 |
| Range | 98497 |
| Interquartile range (IQR) | 45458 |
Descriptive statistics
| Standard deviation | 28932.605 |
|---|---|
| Coefficient of variation (CV) | 0.50056454 |
| Kurtosis | -1.0674234 |
| Mean | 57799.95 |
| Median Absolute Deviation (MAD) | 23455 |
| Skewness | -0.20408177 |
| Sum | 2.0000517 × 109 |
| Variance | 8.3709564 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 99122 | 2057 | 5.9% |
| 99109 | 529 | 1.5% |
| 13055 | 478 | 1.4% |
| 99352 | 406 | 1.2% |
| 75114 | 332 | 1.0% |
| 99123 | 324 | 0.9% |
| 44109 | 307 | 0.9% |
| 59350 | 288 | 0.8% |
| 31555 | 275 | 0.8% |
| 33063 | 232 | 0.7% |
| Other values (6290) | 29375 |
| Value | Count | Frequency (%) |
| 1004 | 6 | |
| 1005 | 1 | < 0.1% |
| 1017 | 1 | < 0.1% |
| 1026 | 1 | < 0.1% |
| 1029 | 1 | < 0.1% |
| 1032 | 2 | < 0.1% |
| 1033 | 2 | < 0.1% |
| 1034 | 7 | |
| 1035 | 1 | < 0.1% |
| 1036 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 99501 | 2 | < 0.1% |
| 99431 | 3 | < 0.1% |
| 99424 | 3 | < 0.1% |
| 99417 | 4 | < 0.1% |
| 99416 | 4 | < 0.1% |
| 99415 | 1 | < 0.1% |
| 99408 | 1 | < 0.1% |
| 99407 | 1 | < 0.1% |
| 99405 | 4 | < 0.1% |
| 99404 | 12 |
db_lib_commune
Text
Missing 
| Distinct | 7862 |
|---|---|
| Distinct (%) | 23.0% |
| Missing | 481 |
| Missing (%) | 1.4% |
| Memory size | 270.5 KiB |
Length
| Max length | 30 |
|---|---|
| Median length | 27 |
| Mean length | 9.380898 |
| Min length | 2 |
Unique
| Unique | 5258 ? |
|---|---|
| Unique (%) | 15.4% |
Sample
| 1st row | BOITRON |
|---|---|
| 2nd row | BRIE-COMTE-ROBERT |
| 3rd row | BROU-SUR-CHANTEREINE |
| 4th row | BROU-SUR-CHANTEREINE |
| 5th row | CERNEUX |
| Value | Count | Frequency (%) |
| paris | 1708 | 4.6% |
| la | 705 | 1.9% |
| le | 683 | 1.8% |
| lyon | 517 | 1.4% |
| marseille | 479 | 1.3% |
| varsovie | 465 | 1.2% |
| nantes | 307 | 0.8% |
| lille | 288 | 0.8% |
| saint | 281 | 0.8% |
| toulouse | 275 | 0.7% |
| Other values (8027) | 31735 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 39520 | |
| A | 30141 | 9.4% |
| N | 27083 | 8.5% |
| R | 24298 | 7.6% |
| S | 23329 | 7.3% |
| I | 22829 | 7.1% |
| L | 22082 | 6.9% |
| O | 19295 | 6.0% |
| U | 14593 | 4.6% |
| T | 13607 | 4.3% |
| Other values (31) | 83318 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 320095 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 39520 | |
| A | 30141 | 9.4% |
| N | 27083 | 8.5% |
| R | 24298 | 7.6% |
| S | 23329 | 7.3% |
| I | 22829 | 7.1% |
| L | 22082 | 6.9% |
| O | 19295 | 6.0% |
| U | 14593 | 4.6% |
| T | 13607 | 4.3% |
| Other values (31) | 83318 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 320095 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 39520 | |
| A | 30141 | 9.4% |
| N | 27083 | 8.5% |
| R | 24298 | 7.6% |
| S | 23329 | 7.3% |
| I | 22829 | 7.1% |
| L | 22082 | 6.9% |
| O | 19295 | 6.0% |
| U | 14593 | 4.6% |
| T | 13607 | 4.3% |
| Other values (31) | 83318 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 320095 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 39520 | |
| A | 30141 | 9.4% |
| N | 27083 | 8.5% |
| R | 24298 | 7.6% |
| S | 23329 | 7.3% |
| I | 22829 | 7.1% |
| L | 22082 | 6.9% |
| O | 19295 | 6.0% |
| U | 14593 | 4.6% |
| T | 13607 | 4.3% |
| Other values (31) | 83318 |
db_code_dpt
Real number (ℝ)
High correlation 
| Distinct | 98 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 57.586452 |
| Minimum | 1 |
|---|---|
| Maximum | 99 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 270.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 10 |
| Q1 | 33 |
| median | 59 |
| Q3 | 78 |
| 95-th percentile | 99 |
| Maximum | 99 |
| Range | 98 |
| Interquartile range (IQR) | 45 |
Descriptive statistics
| Standard deviation | 28.937175 |
|---|---|
| Coefficient of variation (CV) | 0.5024997 |
| Kurtosis | -1.066841 |
| Mean | 57.586452 |
| Median Absolute Deviation (MAD) | 23 |
| Skewness | -0.20121855 |
| Sum | 1992664 |
| Variance | 837.3601 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 99 | 4718 | 13.6% |
| 75 | 2248 | 6.5% |
| 59 | 1742 | 5.0% |
| 62 | 899 | 2.6% |
| 19 | 894 | 2.6% |
| 69 | 760 | 2.2% |
| 13 | 732 | 2.1% |
| 57 | 671 | 1.9% |
| 76 | 666 | 1.9% |
| 44 | 653 | 1.9% |
| Other values (88) | 20620 |
| Value | Count | Frequency (%) |
| 1 | 277 | |
| 2 | 357 | |
| 3 | 188 | |
| 4 | 40 | 0.1% |
| 5 | 87 | 0.3% |
| 6 | 234 | |
| 7 | 246 | |
| 8 | 217 | |
| 9 | 70 | 0.2% |
| 10 | 174 |
| Value | Count | Frequency (%) |
| 99 | 4718 | |
| 98 | 141 | 0.4% |
| 97 | 484 | 1.4% |
| 95 | 99 | 0.3% |
| 94 | 85 | 0.2% |
| 93 | 147 | 0.4% |
| 92 | 162 | 0.5% |
| 91 | 72 | 0.2% |
| 90 | 53 | 0.2% |
| 89 | 197 | 0.6% |
Missing 
| Distinct | 94 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 5472 |
| Missing (%) | 15.8% |
| Memory size | 270.5 KiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | FR-77 |
|---|---|
| 2nd row | FR-77 |
| 3rd row | FR-77 |
| 4th row | FR-77 |
| 5th row | FR-77 |
| Value | Count | Frequency (%) |
| fr-75 | 2248 | 7.7% |
| fr-59 | 1742 | 6.0% |
| fr-62 | 899 | 3.1% |
| fr-19 | 894 | 3.1% |
| fr-69 | 760 | 2.6% |
| fr-13 | 732 | 2.5% |
| fr-57 | 671 | 2.3% |
| fr-76 | 666 | 2.3% |
| fr-44 | 653 | 2.2% |
| fr-33 | 610 | 2.1% |
| Other values (84) | 19256 |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 29131 | |
| R | 29131 | |
| - | 29131 | |
| 5 | 8461 | 5.8% |
| 7 | 7872 | 5.4% |
| 3 | 6067 | 4.2% |
| 6 | 5990 | 4.1% |
| 9 | 5518 | 3.8% |
| 1 | 5494 | 3.8% |
| 2 | 5442 | 3.7% |
| Other values (3) | 13418 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 145655 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| F | 29131 | |
| R | 29131 | |
| - | 29131 | |
| 5 | 8461 | 5.8% |
| 7 | 7872 | 5.4% |
| 3 | 6067 | 4.2% |
| 6 | 5990 | 4.1% |
| 9 | 5518 | 3.8% |
| 1 | 5494 | 3.8% |
| 2 | 5442 | 3.7% |
| Other values (3) | 13418 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 145655 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| F | 29131 | |
| R | 29131 | |
| - | 29131 | |
| 5 | 8461 | 5.8% |
| 7 | 7872 | 5.4% |
| 3 | 6067 | 4.2% |
| 6 | 5990 | 4.1% |
| 9 | 5518 | 3.8% |
| 1 | 5494 | 3.8% |
| 2 | 5442 | 3.7% |
| Other values (3) | 13418 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 145655 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| F | 29131 | |
| R | 29131 | |
| - | 29131 | |
| 5 | 8461 | 5.8% |
| 7 | 7872 | 5.4% |
| 3 | 6067 | 4.2% |
| 6 | 5990 | 4.1% |
| 9 | 5518 | 3.8% |
| 1 | 5494 | 3.8% |
| 2 | 5442 | 3.7% |
| Other values (3) | 13418 |
db_pays
Text
| Distinct | 82 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 270.5 KiB |
Length
| Max length | 27 |
|---|---|
| Median length | 6 |
| Mean length | 6.2376384 |
| Min length | 4 |
Unique
| Unique | 22 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | FRANCE |
|---|---|
| 2nd row | FRANCE |
| 3rd row | FRANCE |
| 4th row | FRANCE |
| 5th row | FRANCE |
| Value | Count | Frequency (%) |
| france | 29263 | |
| pologne | 2057 | 5.9% |
| allemagne | 529 | 1.5% |
| algerie | 406 | 1.2% |
| russie | 324 | 0.9% |
| la | 232 | 0.7% |
| réunion | 232 | 0.7% |
| roumanie | 219 | 0.6% |
| turquie | 152 | 0.4% |
| guadeloupe | 148 | 0.4% |
| Other values (84) | 1316 | 3.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 35659 | |
| N | 33046 | |
| A | 32196 | |
| R | 31172 | |
| C | 29604 | |
| F | 29276 | |
| O | 5071 | 2.3% |
| L | 4246 | 2.0% |
| G | 3603 | 1.7% |
| I | 2461 | 1.1% |
| Other values (22) | 9507 | 4.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 215841 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 35659 | |
| N | 33046 | |
| A | 32196 | |
| R | 31172 | |
| C | 29604 | |
| F | 29276 | |
| O | 5071 | 2.3% |
| L | 4246 | 2.0% |
| G | 3603 | 1.7% |
| I | 2461 | 1.1% |
| Other values (22) | 9507 | 4.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 215841 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 35659 | |
| N | 33046 | |
| A | 32196 | |
| R | 31172 | |
| C | 29604 | |
| F | 29276 | |
| O | 5071 | 2.3% |
| L | 4246 | 2.0% |
| G | 3603 | 1.7% |
| I | 2461 | 1.1% |
| Other values (22) | 9507 | 4.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 215841 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 35659 | |
| N | 33046 | |
| A | 32196 | |
| R | 31172 | |
| C | 29604 | |
| F | 29276 | |
| O | 5071 | 2.3% |
| L | 4246 | 2.0% |
| G | 3603 | 1.7% |
| I | 2461 | 1.1% |
| Other values (22) | 9507 | 4.4% |
db_continent
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 270.5 KiB |
| EUROPE | |
|---|---|
| AFRIQUE | 521 |
| ASIE | 202 |
| AMERIQUE | 38 |
| OCEANIE | 2 |
Length
| Max length | 8 |
|---|---|
| Median length | 6 |
| Mean length | 6.0056353 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EUROPE |
|---|---|
| 2nd row | EUROPE |
| 3rd row | EUROPE |
| 4th row | EUROPE |
| 5th row | EUROPE |
Common Values
| Value | Count | Frequency (%) |
| EUROPE | 33840 | |
| AFRIQUE | 521 | 1.5% |
| ASIE | 202 | 0.6% |
| AMERIQUE | 38 | 0.1% |
| OCEANIE | 2 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| europe | 33840 | |
| afrique | 521 | 1.5% |
| asie | 202 | 0.6% |
| amerique | 38 | 0.1% |
| oceanie | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 68483 | |
| U | 34399 | |
| R | 34399 | |
| O | 33842 | |
| P | 33840 | |
| A | 763 | 0.4% |
| I | 763 | 0.4% |
| Q | 559 | 0.3% |
| F | 521 | 0.3% |
| S | 202 | 0.1% |
| Other values (3) | 42 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 207813 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 68483 | |
| U | 34399 | |
| R | 34399 | |
| O | 33842 | |
| P | 33840 | |
| A | 763 | 0.4% |
| I | 763 | 0.4% |
| Q | 559 | 0.3% |
| F | 521 | 0.3% |
| S | 202 | 0.1% |
| Other values (3) | 42 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 207813 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 68483 | |
| U | 34399 | |
| R | 34399 | |
| O | 33842 | |
| P | 33840 | |
| A | 763 | 0.4% |
| I | 763 | 0.4% |
| Q | 559 | 0.3% |
| F | 521 | 0.3% |
| S | 202 | 0.1% |
| Other values (3) | 42 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 207813 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 68483 | |
| U | 34399 | |
| R | 34399 | |
| O | 33842 | |
| P | 33840 | |
| A | 763 | 0.4% |
| I | 763 | 0.4% |
| Q | 559 | 0.3% |
| F | 521 | 0.3% |
| S | 202 | 0.1% |
| Other values (3) | 42 | < 0.1% |
dd_date
Date
| Distinct | 3339 |
|---|---|
| Distinct (%) | 9.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 270.5 KiB |
| Minimum | 1882-10-01 00:00:00 |
|---|---|
| Maximum | 1970-12-30 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
dd_lib_jour
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 270.5 KiB |
| Samedi | |
|---|---|
| Lundi | |
| Mercredi | |
| Dimanche | |
| Mardi | |
| Other values (2) |
Length
| Max length | 8 |
|---|---|
| Median length | 6 |
| Mean length | 6.4245875 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Mardi |
|---|---|
| 2nd row | Samedi |
| 3rd row | Vendredi |
| 4th row | Samedi |
| 5th row | Lundi |
Common Values
| Value | Count | Frequency (%) |
| Samedi | 5297 | |
| Lundi | 5210 | |
| Mercredi | 5152 | |
| Dimanche | 4995 | |
| Mardi | 4807 | |
| Jeudi | 4623 | |
| Vendredi | 4519 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| samedi | 5297 | |
| lundi | 5210 | |
| mercredi | 5152 | |
| dimanche | 4995 | |
| mardi | 4807 | |
| jeudi | 4623 | |
| vendredi | 4519 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 34603 | |
| e | 34257 | |
| d | 34127 | |
| r | 19630 | |
| a | 15099 | |
| n | 14724 | |
| m | 10292 | 4.6% |
| c | 10147 | 4.6% |
| M | 9959 | 4.5% |
| u | 9833 | 4.4% |
| Other values (6) | 29639 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 222310 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 34603 | |
| e | 34257 | |
| d | 34127 | |
| r | 19630 | |
| a | 15099 | |
| n | 14724 | |
| m | 10292 | 4.6% |
| c | 10147 | 4.6% |
| M | 9959 | 4.5% |
| u | 9833 | 4.4% |
| Other values (6) | 29639 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 222310 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 34603 | |
| e | 34257 | |
| d | 34127 | |
| r | 19630 | |
| a | 15099 | |
| n | 14724 | |
| m | 10292 | 4.6% |
| c | 10147 | 4.6% |
| M | 9959 | 4.5% |
| u | 9833 | 4.4% |
| Other values (6) | 29639 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 222310 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 34603 | |
| e | 34257 | |
| d | 34127 | |
| r | 19630 | |
| a | 15099 | |
| n | 14724 | |
| m | 10292 | 4.6% |
| c | 10147 | 4.6% |
| M | 9959 | 4.5% |
| u | 9833 | 4.4% |
| Other values (6) | 29639 |
dd_week
Real number (ℝ)
| Distinct | 53 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.632055 |
| Minimum | 1 |
|---|---|
| Maximum | 53 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 270.5 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 14 |
| median | 29 |
| Q3 | 40 |
| 95-th percentile | 51 |
| Maximum | 53 |
| Range | 52 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 15.043129 |
|---|---|
| Coefficient of variation (CV) | 0.54440864 |
| Kurtosis | -1.1771503 |
| Mean | 27.632055 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.082183525 |
| Sum | 956152 |
| Variance | 226.29574 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 32 | 1065 | 3.1% |
| 31 | 979 | 2.8% |
| 33 | 894 | 2.6% |
| 35 | 798 | 2.3% |
| 46 | 774 | 2.2% |
| 40 | 743 | 2.1% |
| 52 | 737 | 2.1% |
| 49 | 728 | 2.1% |
| 30 | 726 | 2.1% |
| 6 | 722 | 2.1% |
| Other values (43) | 26437 |
| Value | Count | Frequency (%) |
| 1 | 428 | |
| 2 | 619 | |
| 3 | 582 | |
| 4 | 659 | |
| 5 | 600 | |
| 6 | 722 | |
| 7 | 708 | |
| 8 | 564 | |
| 9 | 620 | |
| 10 | 713 |
| Value | Count | Frequency (%) |
| 53 | 395 | |
| 52 | 737 | |
| 51 | 712 | |
| 50 | 659 | |
| 49 | 728 | |
| 48 | 590 | |
| 47 | 680 | |
| 46 | 774 | |
| 45 | 632 | |
| 44 | 720 |
dd_code_commune
Text
| Distinct | 7462 |
|---|---|
| Distinct (%) | 21.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 270.5 KiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 4938 ? |
|---|---|
| Unique (%) | 14.3% |
Sample
| 1st row | 75114 |
|---|---|
| 2nd row | 89387 |
| 3rd row | 77243 |
| 4th row | 77055 |
| 5th row | 77305 |
| Value | Count | Frequency (%) |
| 99122 | 3597 | 10.5% |
| 13055 | 677 | 2.0% |
| 59350 | 617 | 1.8% |
| 54395 | 480 | 1.4% |
| 33063 | 454 | 1.3% |
| 75114 | 453 | 1.3% |
| 44109 | 380 | 1.1% |
| 69383 | 342 | 1.0% |
| 31555 | 318 | 0.9% |
| 75115 | 299 | 0.9% |
| Other values (7451) | 26549 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 25815 | |
| 2 | 22402 | |
| 9 | 21596 | |
| 0 | 18384 | |
| 5 | 17608 | |
| 3 | 17245 | |
| 4 | 13638 | |
| 6 | 11863 | |
| 7 | 11773 | |
| 8 | 10502 | |
| Other values (3) | 2189 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 173015 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 25815 | |
| 2 | 22402 | |
| 9 | 21596 | |
| 0 | 18384 | |
| 5 | 17608 | |
| 3 | 17245 | |
| 4 | 13638 | |
| 6 | 11863 | |
| 7 | 11773 | |
| 8 | 10502 | |
| Other values (3) | 2189 | 1.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 173015 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 25815 | |
| 2 | 22402 | |
| 9 | 21596 | |
| 0 | 18384 | |
| 5 | 17608 | |
| 3 | 17245 | |
| 4 | 13638 | |
| 6 | 11863 | |
| 7 | 11773 | |
| 8 | 10502 | |
| Other values (3) | 2189 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 173015 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 25815 | |
| 2 | 22402 | |
| 9 | 21596 | |
| 0 | 18384 | |
| 5 | 17608 | |
| 3 | 17245 | |
| 4 | 13638 | |
| 6 | 11863 | |
| 7 | 11773 | |
| 8 | 10502 | |
| Other values (3) | 2189 | 1.3% |
dd_lib_commune
Text
Missing 
| Distinct | 7133 |
|---|---|
| Distinct (%) | 21.0% |
| Missing | 691 |
| Missing (%) | 2.0% |
| Memory size | 270.5 KiB |
Length
| Max length | 43 |
|---|---|
| Median length | 34 |
| Mean length | 10.505957 |
| Min length | 2 |
Unique
| Unique | 4601 ? |
|---|---|
| Unique (%) | 13.6% |
Sample
| 1st row | PARIS-14E-ARRONDISSEMENT |
|---|---|
| 2nd row | SENS |
| 3rd row | LAGNY-SUR-MARNE |
| 4th row | BROU-SUR-CHANTEREINE |
| 5th row | MONTEREAU-FAULT-YONNE |
| Value | Count | Frequency (%) |
| varsovie | 3597 | 10.1% |
| le | 783 | 2.2% |
| marseille | 688 | 1.9% |
| la | 678 | 1.9% |
| lille | 617 | 1.7% |
| nancy | 480 | 1.3% |
| bordeaux | 454 | 1.3% |
| paris-14e-arrondissement | 453 | 1.3% |
| nantes | 380 | 1.1% |
| lyon--3e--arrondissement | 342 | 1.0% |
| Other values (7122) | 27134 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 45929 | |
| A | 31041 | 8.7% |
| R | 30213 | 8.5% |
| S | 29134 | 8.2% |
| N | 28931 | 8.1% |
| I | 25491 | 7.2% |
| O | 21769 | 6.1% |
| L | 21047 | 5.9% |
| - | 18675 | 5.2% |
| T | 14165 | 4.0% |
| Other values (61) | 89883 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 356278 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 45929 | |
| A | 31041 | 8.7% |
| R | 30213 | 8.5% |
| S | 29134 | 8.2% |
| N | 28931 | 8.1% |
| I | 25491 | 7.2% |
| O | 21769 | 6.1% |
| L | 21047 | 5.9% |
| - | 18675 | 5.2% |
| T | 14165 | 4.0% |
| Other values (61) | 89883 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 356278 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 45929 | |
| A | 31041 | 8.7% |
| R | 30213 | 8.5% |
| S | 29134 | 8.2% |
| N | 28931 | 8.1% |
| I | 25491 | 7.2% |
| O | 21769 | 6.1% |
| L | 21047 | 5.9% |
| - | 18675 | 5.2% |
| T | 14165 | 4.0% |
| Other values (61) | 89883 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 356278 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 45929 | |
| A | 31041 | 8.7% |
| R | 30213 | 8.5% |
| S | 29134 | 8.2% |
| N | 28931 | 8.1% |
| I | 25491 | 7.2% |
| O | 21769 | 6.1% |
| L | 21047 | 5.9% |
| - | 18675 | 5.2% |
| T | 14165 | 4.0% |
| Other values (61) | 89883 |
dd_code_dpt
Text
| Distinct | 102 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 270.5 KiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 75 |
|---|---|
| 2nd row | 89 |
| 3rd row | 77 |
| 4th row | 77 |
| 5th row | 77 |
| Value | Count | Frequency (%) |
| 99 | 5134 | 15.0% |
| 75 | 1790 | 5.2% |
| 59 | 1689 | 4.9% |
| 69 | 1028 | 3.0% |
| 13 | 1019 | 3.0% |
| 62 | 758 | 2.2% |
| 33 | 722 | 2.1% |
| 44 | 690 | 2.0% |
| 54 | 676 | 2.0% |
| 76 | 615 | 1.8% |
| Other values (91) | 20045 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 16804 | |
| 5 | 7923 | |
| 7 | 7455 | |
| 3 | 6794 | |
| 6 | 5987 | 8.7% |
| 4 | 5838 | 8.4% |
| 1 | 5307 | 7.7% |
| 2 | 4906 | 7.1% |
| 8 | 4169 | 6.0% |
| 0 | 3145 | 4.5% |
| Other values (3) | 878 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 69206 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 9 | 16804 | |
| 5 | 7923 | |
| 7 | 7455 | |
| 3 | 6794 | |
| 6 | 5987 | 8.7% |
| 4 | 5838 | 8.4% |
| 1 | 5307 | 7.7% |
| 2 | 4906 | 7.1% |
| 8 | 4169 | 6.0% |
| 0 | 3145 | 4.5% |
| Other values (3) | 878 | 1.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 69206 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 9 | 16804 | |
| 5 | 7923 | |
| 7 | 7455 | |
| 3 | 6794 | |
| 6 | 5987 | 8.7% |
| 4 | 5838 | 8.4% |
| 1 | 5307 | 7.7% |
| 2 | 4906 | 7.1% |
| 8 | 4169 | 6.0% |
| 0 | 3145 | 4.5% |
| Other values (3) | 878 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 69206 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 9 | 16804 | |
| 5 | 7923 | |
| 7 | 7455 | |
| 3 | 6794 | |
| 6 | 5987 | 8.7% |
| 4 | 5838 | 8.4% |
| 1 | 5307 | 7.7% |
| 2 | 4906 | 7.1% |
| 8 | 4169 | 6.0% |
| 0 | 3145 | 4.5% |
| Other values (3) | 878 | 1.3% |
Missing 
| Distinct | 96 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 6334 |
| Missing (%) | 18.3% |
| Memory size | 270.5 KiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | FR-75 |
|---|---|
| 2nd row | FR-89 |
| 3rd row | FR-77 |
| 4th row | FR-77 |
| 5th row | FR-77 |
| Value | Count | Frequency (%) |
| fr-75 | 1790 | 6.3% |
| fr-59 | 1689 | 6.0% |
| fr-69 | 1028 | 3.6% |
| fr-13 | 1019 | 3.6% |
| fr-62 | 758 | 2.7% |
| fr-33 | 722 | 2.6% |
| fr-44 | 690 | 2.4% |
| fr-54 | 676 | 2.4% |
| fr-76 | 615 | 2.2% |
| fr-57 | 536 | 1.9% |
| Other values (86) | 18746 |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 28269 | |
| R | 28269 | |
| - | 28269 | |
| 5 | 7923 | 5.6% |
| 7 | 6845 | 4.8% |
| 3 | 6794 | 4.8% |
| 6 | 5987 | 4.2% |
| 9 | 5904 | 4.2% |
| 4 | 5838 | 4.1% |
| 1 | 5307 | 3.8% |
| Other values (5) | 11940 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 141345 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| F | 28269 | |
| R | 28269 | |
| - | 28269 | |
| 5 | 7923 | 5.6% |
| 7 | 6845 | 4.8% |
| 3 | 6794 | 4.8% |
| 6 | 5987 | 4.2% |
| 9 | 5904 | 4.2% |
| 4 | 5838 | 4.1% |
| 1 | 5307 | 3.8% |
| Other values (5) | 11940 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 141345 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| F | 28269 | |
| R | 28269 | |
| - | 28269 | |
| 5 | 7923 | 5.6% |
| 7 | 6845 | 4.8% |
| 3 | 6794 | 4.8% |
| 6 | 5987 | 4.2% |
| 9 | 5904 | 4.2% |
| 4 | 5838 | 4.1% |
| 1 | 5307 | 3.8% |
| Other values (5) | 11940 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 141345 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| F | 28269 | |
| R | 28269 | |
| - | 28269 | |
| 5 | 7923 | 5.6% |
| 7 | 6845 | 4.8% |
| 3 | 6794 | 4.8% |
| 6 | 5987 | 4.2% |
| 9 | 5904 | 4.2% |
| 4 | 5838 | 4.1% |
| 1 | 5307 | 3.8% |
| Other values (5) | 11940 |
dd_pays
Text
| Distinct | 83 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 270.5 KiB |
Length
| Max length | 33 |
|---|---|
| Median length | 6 |
| Mean length | 6.1927634 |
| Min length | 4 |
Unique
| Unique | 20 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | FRANCE |
|---|---|
| 2nd row | FRANCE |
| 3rd row | FRANCE |
| 4th row | FRANCE |
| 5th row | FRANCE |
| Value | Count | Frequency (%) |
| france | 29469 | |
| pologne | 3597 | 10.4% |
| allemagne | 279 | 0.8% |
| algerie | 229 | 0.7% |
| belgique | 109 | 0.3% |
| suisse | 103 | 0.3% |
| espagne | 77 | 0.2% |
| autriche | 65 | 0.2% |
| maroc | 51 | 0.1% |
| italie | 49 | 0.1% |
| Other values (83) | 675 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 35284 | |
| N | 33819 | |
| A | 31097 | |
| R | 30097 | |
| C | 29768 | |
| F | 29492 | |
| O | 7485 | 3.5% |
| L | 4800 | 2.2% |
| G | 4413 | 2.1% |
| P | 3718 | 1.7% |
| Other values (21) | 4309 | 2.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 214282 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 35284 | |
| N | 33819 | |
| A | 31097 | |
| R | 30097 | |
| C | 29768 | |
| F | 29492 | |
| O | 7485 | 3.5% |
| L | 4800 | 2.2% |
| G | 4413 | 2.1% |
| P | 3718 | 1.7% |
| Other values (21) | 4309 | 2.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 214282 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 35284 | |
| N | 33819 | |
| A | 31097 | |
| R | 30097 | |
| C | 29768 | |
| F | 29492 | |
| O | 7485 | 3.5% |
| L | 4800 | 2.2% |
| G | 4413 | 2.1% |
| P | 3718 | 1.7% |
| Other values (21) | 4309 | 2.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 214282 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 35284 | |
| N | 33819 | |
| A | 31097 | |
| R | 30097 | |
| C | 29768 | |
| F | 29492 | |
| O | 7485 | 3.5% |
| L | 4800 | 2.2% |
| G | 4413 | 2.1% |
| P | 3718 | 1.7% |
| Other values (21) | 4309 | 2.0% |
dd_continent
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 270.5 KiB |
| EUROPE | |
|---|---|
| AFRIQUE | 417 |
| AMERIQUE | 139 |
| ASIE | 86 |
| OCEANIE | 12 |
Length
| Max length | 8 |
|---|---|
| Median length | 6 |
| Mean length | 6.0154615 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EUROPE |
|---|---|
| 2nd row | EUROPE |
| 3rd row | EUROPE |
| 4th row | EUROPE |
| 5th row | EUROPE |
Common Values
| Value | Count | Frequency (%) |
| EUROPE | 33948 | |
| AFRIQUE | 417 | 1.2% |
| AMERIQUE | 139 | 0.4% |
| ASIE | 86 | 0.2% |
| OCEANIE | 12 | < 0.1% |
| (Missing) | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| europe | 33948 | |
| afrique | 417 | 1.2% |
| amerique | 139 | 0.4% |
| asie | 86 | 0.2% |
| oceanie | 12 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 68701 | |
| U | 34504 | |
| R | 34504 | |
| O | 33960 | |
| P | 33948 | |
| A | 654 | 0.3% |
| I | 654 | 0.3% |
| Q | 556 | 0.3% |
| F | 417 | 0.2% |
| M | 139 | 0.1% |
| Other values (3) | 110 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 208147 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 68701 | |
| U | 34504 | |
| R | 34504 | |
| O | 33960 | |
| P | 33948 | |
| A | 654 | 0.3% |
| I | 654 | 0.3% |
| Q | 556 | 0.3% |
| F | 417 | 0.2% |
| M | 139 | 0.1% |
| Other values (3) | 110 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 208147 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 68701 | |
| U | 34504 | |
| R | 34504 | |
| O | 33960 | |
| P | 33948 | |
| A | 654 | 0.3% |
| I | 654 | 0.3% |
| Q | 556 | 0.3% |
| F | 417 | 0.2% |
| M | 139 | 0.1% |
| Other values (3) | 110 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 208147 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 68701 | |
| U | 34504 | |
| R | 34504 | |
| O | 33960 | |
| P | 33948 | |
| A | 654 | 0.3% |
| I | 654 | 0.3% |
| Q | 556 | 0.3% |
| F | 417 | 0.2% |
| M | 139 | 0.1% |
| Other values (3) | 110 | 0.1% |
iso_date
Categorical
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 270.5 KiB |
| 0 | |
|---|---|
| 1 | 1368 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 33235 | |
| 1 | 1368 | 4.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 33235 | |
| 1 | 1368 | 4.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 33235 | |
| 1 | 1368 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 34603 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 33235 | |
| 1 | 1368 | 4.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 34603 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 33235 | |
| 1 | 1368 | 4.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 34603 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 33235 | |
| 1 | 1368 | 4.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 24315 | |
| 0 | 10288 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 24315 | |
| 0 | 10288 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 24315 | |
| 0 | 10288 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 34603 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 24315 | |
| 0 | 10288 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 34603 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 24315 | |
| 0 | 10288 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 34603 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 24315 | |
| 0 | 10288 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 22658 | |
| 1 | 11945 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 22658 | |
| 1 | 11945 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 22658 | |
| 1 | 11945 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 34603 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 22658 | |
| 1 | 11945 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 34603 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 22658 | |
| 1 | 11945 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 34603 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 22658 | |
| 1 | 11945 |
iso_pays
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 270.5 KiB |
| 1 | |
|---|---|
| 0 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 30527 | |
| 0 | 4076 | 11.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 30527 | |
| 0 | 4076 | 11.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 30527 | |
| 0 | 4076 | 11.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 34603 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 30527 | |
| 0 | 4076 | 11.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 34603 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 30527 | |
| 0 | 4076 | 11.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 34603 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 30527 | |
| 0 | 4076 | 11.8% |
distance
Real number (ℝ)
Zeros 
| Distinct | 1452 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4206636.6 |
| Minimum | 0 |
|---|---|
| Maximum | 1.2345679 × 108 |
| Zeros | 12051 |
| Zeros (%) | 34.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 270.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 11 |
| Q3 | 94.5 |
| 95-th percentile | 1764.1 |
| Maximum | 1.2345679 × 108 |
| Range | 1.2345679 × 108 |
| Interquartile range (IQR) | 94.5 |
Descriptive statistics
| Standard deviation | 22397158 |
|---|---|
| Coefficient of variation (CV) | 5.3242436 |
| Kurtosis | 24.38842 |
| Mean | 4206636.6 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 5.1368288 |
| Sum | 1.4556225 × 1011 |
| Variance | 5.0163269 × 1014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 12051 | |
| 123456790 | 1179 | 3.4% |
| 3 | 732 | 2.1% |
| 4 | 732 | 2.1% |
| 5 | 657 | 1.9% |
| 6 | 653 | 1.9% |
| 7 | 645 | 1.9% |
| 519 | 476 | 1.4% |
| 8 | 466 | 1.3% |
| 9 | 449 | 1.3% |
| Other values (1442) | 16563 |
| Value | Count | Frequency (%) |
| 0 | 12051 | |
| 1 | 16 | < 0.1% |
| 2 | 253 | 0.7% |
| 3 | 732 | 2.1% |
| 4 | 732 | 2.1% |
| 5 | 657 | 1.9% |
| 6 | 653 | 1.9% |
| 7 | 645 | 1.9% |
| 8 | 466 | 1.3% |
| 9 | 449 | 1.3% |
| Value | Count | Frequency (%) |
| 123456790 | 1179 | |
| 19264 | 1 | < 0.1% |
| 18817 | 1 | < 0.1% |
| 17169 | 2 | < 0.1% |
| 17168 | 1 | < 0.1% |
| 17002 | 1 | < 0.1% |
| 16912 | 1 | < 0.1% |
| 16592 | 1 | < 0.1% |
| 15807 | 1 | < 0.1% |
| 15692 | 1 | < 0.1% |
Interactions
Correlations
| age | db_code_commune | db_code_dpt | db_continent | db_lib_jour | db_week | dd_continent | dd_lib_jour | dd_week | distance | iso_commune | iso_date | iso_dpt | iso_pays | long_nom | nbre_prenoms | sexe | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| age | 1.000 | 0.147 | 0.149 | 0.083 | 0.013 | 0.001 | 0.059 | 0.047 | 0.196 | 0.324 | 0.180 | 0.192 | 0.239 | 0.429 | 0.033 | -0.100 | 0.126 |
| db_code_commune | 0.147 | 1.000 | 0.999 | 0.164 | 0.000 | -0.014 | 0.067 | 0.046 | -0.052 | 0.061 | 0.143 | 0.056 | 0.210 | 0.574 | 0.053 | -0.215 | 0.053 |
| db_code_dpt | 0.149 | 0.999 | 1.000 | 0.164 | 0.000 | -0.013 | 0.067 | 0.046 | -0.051 | 0.062 | 0.143 | 0.056 | 0.210 | 0.574 | 0.053 | -0.216 | 0.053 |
| db_continent | 0.083 | 0.164 | 0.164 | 1.000 | 0.009 | 0.017 | 0.386 | 0.015 | 0.014 | 0.103 | 0.045 | 0.008 | 0.032 | 0.252 | 0.000 | 0.068 | 0.027 |
| db_lib_jour | 0.013 | 0.000 | 0.000 | 0.009 | 1.000 | 0.015 | 0.000 | 0.047 | 0.007 | 0.004 | 0.015 | 0.018 | 0.009 | 0.000 | 0.000 | 0.003 | 0.000 |
| db_week | 0.001 | -0.014 | -0.013 | 0.017 | 0.015 | 1.000 | 0.006 | 0.007 | 0.199 | -0.006 | 0.018 | 0.009 | 0.017 | 0.023 | 0.005 | -0.009 | 0.022 |
| dd_continent | 0.059 | 0.067 | 0.067 | 0.386 | 0.000 | 0.006 | 1.000 | 0.000 | 0.009 | 0.013 | 0.047 | 0.000 | 0.074 | 0.207 | 0.011 | 0.023 | 0.020 |
| dd_lib_jour | 0.047 | 0.046 | 0.046 | 0.015 | 0.047 | 0.007 | 0.000 | 1.000 | 0.051 | 0.016 | 0.037 | 0.027 | 0.033 | 0.061 | 0.007 | 0.028 | 0.024 |
| dd_week | 0.196 | -0.052 | -0.051 | 0.014 | 0.007 | 0.199 | 0.009 | 0.051 | 1.000 | 0.061 | 0.061 | 0.035 | 0.058 | 0.083 | 0.004 | -0.034 | 0.021 |
| distance | 0.324 | 0.061 | 0.062 | 0.103 | 0.004 | -0.006 | 0.013 | 0.016 | 0.061 | 1.000 | 0.136 | 0.009 | 0.208 | 0.222 | -0.006 | -0.015 | 0.009 |
| iso_commune | 0.180 | 0.143 | 0.143 | 0.045 | 0.015 | 0.018 | 0.047 | 0.037 | 0.061 | 0.136 | 1.000 | 0.062 | 0.472 | 0.213 | 0.025 | 0.064 | 0.031 |
| iso_date | 0.192 | 0.056 | 0.056 | 0.008 | 0.018 | 0.009 | 0.000 | 0.027 | 0.035 | 0.009 | 0.062 | 1.000 | 0.061 | 0.030 | 0.008 | 0.062 | 0.010 |
| iso_dpt | 0.239 | 0.210 | 0.210 | 0.032 | 0.009 | 0.017 | 0.074 | 0.033 | 0.058 | 0.208 | 0.472 | 0.061 | 1.000 | 0.101 | 0.038 | 0.094 | 0.024 |
| iso_pays | 0.429 | 0.574 | 0.574 | 0.252 | 0.000 | 0.023 | 0.207 | 0.061 | 0.083 | 0.222 | 0.213 | 0.030 | 0.101 | 1.000 | 0.106 | 0.226 | 0.000 |
| long_nom | 0.033 | 0.053 | 0.053 | 0.000 | 0.000 | 0.005 | 0.011 | 0.007 | 0.004 | -0.006 | 0.025 | 0.008 | 0.038 | 0.106 | 1.000 | -0.055 | 0.000 |
| nbre_prenoms | -0.100 | -0.215 | -0.216 | 0.068 | 0.003 | -0.009 | 0.023 | 0.028 | -0.034 | -0.015 | 0.064 | 0.062 | 0.094 | 0.226 | -0.055 | 1.000 | 0.061 |
| sexe | 0.126 | 0.053 | 0.053 | 0.027 | 0.000 | 0.022 | 0.020 | 0.024 | 0.021 | 0.009 | 0.031 | 0.010 | 0.024 | 0.000 | 0.000 | 0.061 | 1.000 |
Missing values
Sample
| nom | prenom | sexe | age | long_nom | nbre_prenoms | db_date | db_lib_jour | db_week | db_code_commune | db_lib_commune | db_code_dpt | db_dept_isocode_3166 | db_pays | db_continent | dd_date | dd_lib_jour | dd_week | dd_code_commune | dd_lib_commune | dd_code_dpt | dd_dept_isocode_3166 | dd_pays | dd_continent | iso_date | iso_dpt | iso_commune | iso_pays | distance | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | LETERME | PATRICE,EUGENE,OMER | M | 17 | 7 | 3 | 1953-04-20 | Lundi | 17 | 77043 | BOITRON | 77 | FR-77 | FRANCE | EUROPE | 1970-12-01 | Mardi | 49 | 75114 | PARIS-14E-ARRONDISSEMENT | 75 | FR-75 | FRANCE | EUROPE | 0 | 0 | 0 | 1 | 6.900000e+01 |
| 1 | DEROUT | JEAN,DANIEL | M | 23 | 6 | 2 | 1947-06-22 | Dimanche | 25 | 77053 | BRIE-COMTE-ROBERT | 77 | FR-77 | FRANCE | EUROPE | 1970-09-05 | Samedi | 36 | 89387 | SENS | 89 | FR-89 | FRANCE | EUROPE | 0 | 0 | 0 | 1 | 7.500000e+01 |
| 2 | GRAS Y PLASSARD | BORIS,GUY | M | 1 | 15 | 2 | 1969-06-03 | Mardi | 23 | 77055 | BROU-SUR-CHANTEREINE | 77 | FR-77 | FRANCE | EUROPE | 1970-11-06 | Vendredi | 45 | 77243 | LAGNY-SUR-MARNE | 77 | FR-77 | FRANCE | EUROPE | 0 | 1 | 0 | 1 | 5.000000e+00 |
| 3 | LENOIR | VALERIE,PAULE | F | 0 | 6 | 2 | 1970-03-14 | Samedi | 11 | 77055 | BROU-SUR-CHANTEREINE | 77 | FR-77 | FRANCE | EUROPE | 1970-03-14 | Samedi | 11 | 77055 | BROU-SUR-CHANTEREINE | 77 | FR-77 | FRANCE | EUROPE | 1 | 1 | 1 | 1 | 0.000000e+00 |
| 4 | LESZCZUK | MARIE-AGNES | F | 0 | 8 | 1 | 1969-11-01 | Samedi | 44 | 77066 | CERNEUX | 77 | FR-77 | FRANCE | EUROPE | 1970-01-19 | Lundi | 4 | 77305 | MONTEREAU-FAULT-YONNE | 77 | FR-77 | FRANCE | EUROPE | 0 | 1 | 0 | 1 | 4.500000e+01 |
| 5 | BOSCHETTI | CHANTAL,MARGUERITE,MADELEINE | F | 20 | 9 | 3 | 1950-10-10 | Mardi | 41 | 77071 | CHAINTREAUX | 77 | FR-77 | FRANCE | EUROPE | 1970-11-12 | Jeudi | 46 | 77387 | REMAUVILLE | 77 | FR-77 | FRANCE | EUROPE | 0 | 1 | 0 | 1 | 3.000000e+00 |
| 6 | LE CHARPENTIER | JEAN-LUC,GEORGES | M | 15 | 14 | 2 | 1955-03-18 | Vendredi | 11 | 77082 | CHAMPEAUX | 77 | FR-77 | FRANCE | EUROPE | 1970-07-30 | Jeudi | 31 | 77288 | MELUN | 77 | FR-77 | FRANCE | EUROPE | 0 | 1 | 0 | 1 | 1.200000e+01 |
| 7 | DEMELLIER | FRANCIS,MICHEL | M | 20 | 9 | 2 | 1950-02-05 | Dimanche | 5 | 77083 | CHAMPS-SUR-MARNE | 77 | FR-77 | FRANCE | EUROPE | 1970-08-17 | Lundi | 34 | 99109 | BERLIN | 99 | NaN | ALLEMAGNE | EUROPE | 0 | 0 | 0 | 0 | 8.650000e+02 |
| 8 | CADELLE | RENE,EDMOND,ALFRED | M | 24 | 7 | 3 | 1946-06-21 | Vendredi | 25 | 77092 | LA CHAPELLE-SUR-CRECY | 77 | FR-77 | FRANCE | EUROPE | 1970-12-16 | Mercredi | 51 | 77288 | MELUN | 77 | FR-77 | FRANCE | EUROPE | 0 | 1 | 0 | 1 | 1.234568e+08 |
| 9 | CHOPINET | BERNARD,DENIS | M | 19 | 8 | 2 | 1950-10-03 | Mardi | 40 | 77099 | CHATEAU-LANDON | 77 | FR-77 | FRANCE | EUROPE | 1970-03-01 | Dimanche | 9 | 45252 | PITHIVIERS | 45 | FR-45 | FRANCE | EUROPE | 0 | 0 | 0 | 1 | 3.400000e+01 |
| nom | prenom | sexe | age | long_nom | nbre_prenoms | db_date | db_lib_jour | db_week | db_code_commune | db_lib_commune | db_code_dpt | db_dept_isocode_3166 | db_pays | db_continent | dd_date | dd_lib_jour | dd_week | dd_code_commune | dd_lib_commune | dd_code_dpt | dd_dept_isocode_3166 | dd_pays | dd_continent | iso_date | iso_dpt | iso_commune | iso_pays | distance | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 34593 | COUSIN DE MAUVAISIN | MIREILLE,PAULE | F | 19 | 19 | 2 | 1950-05-29 | Lundi | 22 | 13055 | MARSEILLE | 13 | FR-13 | FRANCE | EUROPE | 1970-02-05 | Jeudi | 6 | 13001 | AIX-EN-PROVENCE | 13 | FR-13 | FRANCE | EUROPE | 0 | 1 | 0 | 1 | 26.0 |
| 34594 | GUERRI | GENEVIEVE,MARCELLE,FERNANDE,PHILOMENE | F | 20 | 6 | 4 | 1950-05-08 | Lundi | 19 | 13055 | MARSEILLE | 13 | FR-13 | FRANCE | EUROPE | 1970-07-26 | Dimanche | 30 | 69238 | SAINT-SYMPHORIEN-SUR-COISE | 69 | FR-69 | FRANCE | EUROPE | 0 | 0 | 0 | 1 | 269.0 |
| 34595 | LEBARBENCHON | MONIQUE,CHANTAL,GENEVIEVE | F | 19 | 12 | 3 | 1950-05-06 | Samedi | 18 | 13055 | MARSEILLE | 13 | FR-13 | FRANCE | EUROPE | 1970-03-05 | Jeudi | 10 | 13055 | MARSEILLE | 13 | FR-13 | FRANCE | EUROPE | 0 | 1 | 1 | 1 | 0.0 |
| 34596 | BAUDRY | ERIC,JEAN-MARIE | M | 20 | 6 | 2 | 1950-07-21 | Vendredi | 29 | 13055 | MARSEILLE | 13 | FR-13 | FRANCE | EUROPE | 1970-12-13 | Dimanche | 50 | 13055 | MARSEILLE | 13 | FR-13 | FRANCE | EUROPE | 0 | 1 | 1 | 1 | 0.0 |
| 34597 | DE PASSORIO PEYSSARD | MICHEL,HENRI,MARCEL | M | 20 | 20 | 3 | 1950-07-16 | Dimanche | 28 | 13055 | MARSEILLE | 13 | FR-13 | FRANCE | EUROPE | 1970-10-09 | Vendredi | 41 | 13055 | MARSEILLE | 13 | FR-13 | FRANCE | EUROPE | 0 | 1 | 1 | 1 | 0.0 |
| 34598 | ALBERTINI | JACQUES,JOSEPH | M | 19 | 9 | 2 | 1950-10-08 | Dimanche | 40 | 13055 | MARSEILLE | 13 | FR-13 | FRANCE | EUROPE | 1970-05-15 | Vendredi | 20 | 13055 | MARSEILLE | 13 | FR-13 | FRANCE | EUROPE | 0 | 1 | 1 | 1 | 0.0 |
| 34599 | DI MARIA | ALAIN,LEON | M | 19 | 8 | 2 | 1950-11-21 | Mardi | 47 | 13055 | MARSEILLE | 13 | FR-13 | FRANCE | EUROPE | 1970-09-14 | Lundi | 38 | 13055 | MARSEILLE | 13 | FR-13 | FRANCE | EUROPE | 0 | 1 | 1 | 1 | 0.0 |
| 34600 | BREUZA | ROBERT,ROGER | M | 19 | 6 | 2 | 1950-12-15 | Vendredi | 50 | 13055 | MARSEILLE | 13 | FR-13 | FRANCE | EUROPE | 1970-02-19 | Jeudi | 8 | 13055 | MARSEILLE | 13 | FR-13 | FRANCE | EUROPE | 0 | 1 | 1 | 1 | 0.0 |
| 34601 | BERNARD | JEAN-PAUL,LOUIS,GILBERT | M | 19 | 7 | 3 | 1951-02-27 | Mardi | 9 | 13055 | MARSEILLE | 13 | FR-13 | FRANCE | EUROPE | 1970-11-07 | Samedi | 45 | 13055 | MARSEILLE | 13 | FR-13 | FRANCE | EUROPE | 0 | 1 | 1 | 1 | 0.0 |
| 34602 | BRUNET | JEAN,PAUL,MARIUS | M | 18 | 6 | 3 | 1951-02-28 | Mercredi | 9 | 13055 | MARSEILLE | 13 | FR-13 | FRANCE | EUROPE | 1970-01-16 | Vendredi | 3 | 13055 | MARSEILLE | 13 | FR-13 | FRANCE | EUROPE | 0 | 1 | 1 | 1 | 0.0 |